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Abstract 

We introduce a robust, error-tolerant adaptive training algorithm 
for generalized learning paradigms in high-dimensional superposed 
quantum networks, or adaptive quantum networks. The formalized 
procedure applies standard backpropagation training across a coher- 
. ent ensemble of discrete topological configurations of individual neu- 

QTi ral networks, each of which is formally merged into appropriate linear 

superposition within a predefined, decoherence-free subspace. Quan- 
tum parallelism facilitates simultaneous training and revision of the 
system within this coherent state space, resulting in accelerated con- 
vergence to a stable network attractor under consequent iteration 
of the implemented backpropagation algorithm. Parallel evolution 
of linear superposed networks incorporating backpropagation train- 
ed ' ing provides quantitative, numerical indications for optimization of 
■ both single-neuron activation functions and optimal reconfiguration 
O . of whole-network quantum structure. 

Keywords: Neural networks; Quantum topology; Adaptive learn- 
ing 



1 Introduction 



Artificial neural networks are routinely applied to resolve unstructured or 
multivariate machine learning problems such as high-speed pattern recog- 
nition, image processing and associative pattern matching tasks. Due to 
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their novelty, however, quantum neural networks remain relatively uncharted 
in the artificial intelligence and quantum algorithms communities. Sev- 
eral groups [H |2l O Hj have outlined preliminary quantum network archi- 
tectures; each novel approach contributes significant insights towards appli- 
cation methodology, alternative implementations and underlying modal in- 
terpretation - however, widespread and effective universal implementation of 
quantum neural networks remains an open research question, as both theoret- 
ical and experimental toolsets are still in the incipient stages of development 
and maturation. 

A common underlying thread shared by quantum network proposals to 
date is that each implements superposition of neural transition functions 
upon a single, fixed topological foundation. In this letter, we expand upon 
a novel framework initially introduced in [5J, which diverges from prior net- 
work models by fine-tuning not only optimization of neural transition func- 
tions - but by fully reconfiguring the connective physical topology of the 
quantum network itself. The mathematical formalism employed for this ap- 
proach descends from the Rota algebraic spatialization procedure of evolving 
reticular quantum structures, which was initially developed in [5] to address 
superposed topological manifolds of spacetime foam as described in quantum 
gravity. Subsequently, not only neuron weighting and transition functions - 
but also linear superposition of the network topology itself - are subject to 
training and revision within this coherent state space. 

We formally incorporate the standard backpropagation training algorithm 
initially introduced by Werbos in [7]. Repeated iteration of training series 
results in convergence of sample output to a stable network attractor corre- 
sponding to the lowest energy configuration space between given input and 
desired output layer. Following convergence to this minimum, the superposed 
linear network is converted upon measurement to a conventional, classical 
neural network by consequent application of the Rota algebraic projection 
formalism. 

2 'Linear' Neural Networks — Dual contexts 

Traditional computers execute algorithms - that is, they follow a specific set 
of instructions to arrive at a solution to a given problem. Artificial neural 
networks, by contrast, learn by trial and error: training through example. 
In a specific class of neural networks - multilayered, feedforward neural net- 
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works - signals are allowed to propagate only forward: there is no feedback 
process in the training phase. These connection patterns are formally clas- 
sified as mathematical structures known as posets - partially ordered sets, 

or DIRECTED ACYCLIC GRAPHS - DAGS. 

In this letter, we convert the primary component of artificial neural net- 
works - directed acyclic graphs - into a set of linear matrices. The training 
optimization protocol is then reclassified in terms of unitary matrices and ma- 
trix operations. The result of training optimization is then a matrix, rather 
than a directed acyclic graph. In order to recover the initial graph structure 
upon derivation of the appropriate solution, the Rota algebraic spatialization 
procedure [8] is applied to recover the appropriate graph. 

Herein we use the term adaptive neural networks. However, the primary 
objects subject to revision and training are operators in linear spaces - which 
can be realized, for instance, as quantum observables, rather than as explic- 
itly defined neural networks. In our paper, we shall use the term 'linear' 
under two separate contexts. The first definition implies restriction to lin- 
ear artificial neural networks, as we defer application of nonlinear network 
optimization problems to a subsequent manuscript. In this letter, we focus 
solely upon optimized approach and applications of linear neural networks. 

The second usage of linearization is the standard linear formalism central 
to quantum mechanics. This is the basis of our mathematical approach, and 
is outlined in further detail. The topology of a feedforward artificial neural 
network, Af, is described by the template matrix A of the appropriate 
directed acyclic graph, which is formed as follows: 

A = f *, if j -> k in M 
J 1 0, otherwise 

where * stands for a wildcard - any number, and the set of such numerical 
matrices form an algebra [TO] as it is closed under multiplication. The main 
property of A is that the synaptic weights 'follow' it - namely, if Ajk = 0, 
then Wjk = 0. 

Returning to picture ([2]) of signal propagation in J\f, taking various ma- 
trices A - and allowing them only to comply with the template matrix (JTJ) 
- we form various products ([2]). The resulting set of matrices is called the 
Rota algebra of M. It can be verified that this set is closed under sums 
and matrix products, and thus qualifies as a closed algebra. This description 
is explicated under greater detail in [5]. 
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Given a feedforward neural network M with the set of nodes M, consider 
the linear space 7i, whose basis is labeled by the elements of M. A vector 
x G 7i is associated with a state of the network, namely the j-th component 
of x is the activity in node j. Note that the state of M captures the activities 
of all nodes, so a convention is required to define the default activity value 
of the nodes - which are thus set to zero. 

Initial activation corresponds to changing values of only a part of nodes - 
specifically those in the input layer. The signal then propagates through A/", 
and we describe this activation by a lineailj operator W in Tt. The time in 
which the signal requires to advance is described as the following evolution 
of the state of the simulator: We start with an initial input vector x , thus: 

x xi = x jy !->•... h-> x out = x n = x iy n (2) 

following n steps. Recall that postfix notation is employed in this instance. 

The crucial feature of the suggested approach is that signal propagation is 
now described by iteration of the same operator W . From this step forward, 
the operator W, rather than the collection of weights as in the traditional 
backpropagation approach, will be subject to training. 

3 Performing standard operations in linearized 
form 

In this section, we reformulate classical neuron activity, propagation, and er- 
ror backpropagation methods in linear terms. A first detail to consider is the 
process of signal propagation in terms of Rota algebras. We initially have a 
register - a sequence of unique numbers defining the artificial neural network 
state - and view it as a vector x in the appropriate linear space. Following 
formula (j2J), we consecutively act upon the register, following elements of the 
Rota algebra. Initially, setup is defined as x = 0. Recall that the entries 
of x correspond to activities of a single neuron. When the initial signal is 
input, this results in the fragment of x corresponding to the input layer ac- 
quiring the appropriate values, while the remainder is still set to zero. The 
application of W activates the neurons of the second layer. Following this 
action, the first fragment bears the input values, and the second fragment 

1 Recall that in this paper we focus solely on linear neural networks. 
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corresponds to the net inputs of the second layer. This activation function 
is then applied forward throughout the network. 

The final vector, x n , possesses the following structure, consisting of blocks 
corresponding to layers - the input block contains corresponding input data, 
and the intermediate blocks contain net inputs from the previous layers. This 
captures the property that as the signal propagates through the network, the 
intermediate neurons retain their values. On the other hand, this requirement 
is not essential - within the same model, we may also consider artificial neural 
networks with simultaneous signal inputs at the propagation point of a single 
input pattern. When presented with a given training set, the overall error 
function may be minimized by solving the equation, 

JLAW-T-O (3) 

One disadvantage at this point is that as the dimensionality of the state 
space grows, the exact solution of this equation may become intractable, 
at which point approximation methods such as gradient descent may need 
to be employed. To implement this, an error backpropagation method is 
required. The initial error vector 5 is set to be zero. Just as in ([2]), the 
backpropagation process can then be described. The input, <5 , is defined as 
the difference between the fragment of the output x n of the forthpropagation 
process and the target vector t. The expression for updated errors is the 
product of these two factors: the first factor can be expressed as Wx. n - t - 
The second factor can be expressed as 5tW T , where W T is the transposed 
matrix. In accordance with agreement that the states of M are diagonal 
matrices, rather than vectors, the product of the two factors is well-defined. 
As a result, 

8 t+1 = W^AW T (4) 

The standard weighting adjustment formula can then be written in compact 
form, using the adjacency matrix A of the directed acyclic graph associated 
with our network: 

W u P d = W ™ + $ n . AXn ( 5 ) 

This procedure is then applied recurrently until required error values converge 
to a minimum within predesignated boundary tolerances. It should be noted 
that there is no novel formalism at this early stage of the procedure - this 
step is but a reformulation of a standard algorithm in supervised learning. 
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4 Mathematics of spatialization 



The formalized spatialization procedure developed for adaptive quantum net- 
works is descended from the algebraic description of quantum foam - which 
describes continuously-evolving, superposed topologies of spacetime mani- 
fold fluctuations as calculated under quantum gravity [6]. The key point of 
the spatialization procedure is the following: consider the full matrix algebra 
A as a linear space. From this perspective, the Rota algebras Q described 
by ([I]) are just linear subspaces of A. Having a procedure which - starting 
from a subset of A - creates a topological space, providing the capability to 
discuss superposed configurations of differing neural networks. In this sec- 
tion we present a procedure which - starting from a given subspace of A - 
produces a set, and endows it with a topology that can be associated with 
certain acyclic directed graphs. 

When the set of nodes is fixed, Rota algebras provide a suitable machinery 
to describe topology changes, which are expressed in terms of creation and 
annihilation of edges. In terms of template matrices ([I]), these operations are 
adding or removing asterisks from the appropriate templates. In a fixed basis, 
a Rota algebra is a subspace of a full matrix algebra A - therefore adding or 
removing asterisks is equivalent to adding or removing basis vectors. Thus, 
any particular Rota algebra is a linear subspace of Q. The basic idea of 
our approach is to associate any subspace of Q with an appropriate directed 
acyclic graph. 

Spectra — the emergence of nodes. The notion of a spectrum is tightly 
related with that of an ideal. Suppose again that we have an algebra of 
functions A on a set M. Fix a point m G M, and consider the subset IC^l 
of functions / such that f(m) = 0. First, X itself is an algebra. Furthermore, 
for any / G A and any h G X the product fh is always an element of X. Such 
subsets X are called ideals of the algebra A. Points of M are in one-to-one 
correspondence with the maximal ideals of A^ and M is called spectrum^ 
of A. 

2 A characterization of all two-sided ideals required for spatialization was recently pro- 
vided in [9]. 

3 Note that there may be different spectra for the same algebra. A simple and degenerate 
example is given by the set C of complex numbers. Its spectrum with respect to complex 
numbers consists of one point, Mc = {1}, while its spectrum with respect to real numbers 
consists of two points, Mr = {!,«}. 
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In the case wherein A has infinite dimension, such as in a linear space, 
there are different, non-equivalent and nontrivial topologies which can be de- 
fined on A. There are standard recipes for this step: for example, the Zariski 
topology on the prime spectrum of Q. Unfortunately, on finite-dimensional 
algebras this topology is always discrete - which leaves us no chance to fit 
the requirement of being non-Hausdorff . In terms of graphs, that means that 
the standard recipes can be utilized to create directed arrows. Thus, we are 
compelled to find another topology. For these purposes, the Rota topology 
was suggested in [8]. 

Suppose we are given a finite-dimensional associative - and non-commutative, 
in general - algebra Q. According to standard conceptions and methods 
of modern algebraic geometry, as well as the general algebraic approach to 
physics, we introduce the points of Q as its irreducible representations (IRs). 
The first step of the spatialization procedure is then creating or finding points 
of fl, which will become nodes of the future graph: 

{ points } = { IRs } (6) 

Rota topology. Denote by M the set of points of fl, each of which we 
shall associate with a prime ideal in fl. Consider two points (representations 
of fl) i,j G M and denote by ker i, ker j their kernels. Both of them, being 
kernels of representations, are two-sided ideals in fl, in particular, subsets of 
fl, hence both of the following expressions make sense: 

ker i fl ker j C fl and ker i ■ ker jcO 

The latter denotes the product of subsets of fl: keri ■ ker j = {a e fl \ 3u e 
keri, v € ker j : uv = a}. Since keri, ker j are ideals, we always have the 
inclusion keri • ker j C keri fl ker j, which may be strict or not. Define the 
relation oc on M as follows: 

% oc j if and only if ker % ■ ker j ^ ker % fl ker j (7) 

The Rota topology is then the weakest one in which % oc j implies convergence 
% — ► j of the point i to the point j. Explicitly, the necessary and sufficient 
conditions for i to converge to j in the Rota topology reads: 

% — > j if and only if 3k , . . . ,k t , ■ ■ ■ ,k n \ k = i, k n = j; k t -\ oc k t (8) 
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This operation is called the transitive closure of the relation oc. Note that in 
general, the Rota topology can be defined upon any set of ideals, and it is 
not necessary for Q to be an algebra. Any linear subspace is suited for this 
purpose. 

As outlined in the mathematical formalism, it should be noted that adap- 
tive quantum networks, as coherent quantum metastructures, are not stan- 
dard neural networks. Rather, they are defined as superposed linear spaces, 
a metasystem whose subspaces consist of neural networks - just as a super- 
posed wavefunction possesses an identity distinct from its constituent clas- 
sical counterpart under influence of environmental interaction. In the ideal 
case, only following convergence to the predesignated, target network minima 
under backpropagation training is projection via the spatialization procedure 
applied to convert the metasystem into a classical artificial neural network. 
As graphs are automatically produced as a consequence of this spatializa- 
tion procedure, no ab initio association of states with graphs is requisite to 
implement the model into a candidate physical structure. 

5 Concluding remarks 

We have integrated standard neural network backpropagation training j7] 
into a linear, decoherence-free subspace of superposed quantum network 
topologies [5] - or adaptive quantum networks - to simultaneously optimize 
both neural transition functions and topological network configuration. The 
framework is both robust and error-tolerant against local permutations, and 
as such is ideally suited to applications in rapid pattern matching, signal 
processing, image recognition and associative learning. 

Coherent quantum information processing architectures that are both 
well-suited for implementation of adaptive quantum networks and demon- 
strate promising experimental progress to date include highly-entangled, clus- 
ter state quantum computers suggested by Raussendorf and Briegel [TTJ 
and implement recently by research group headed by Zeilinger [12], high- 
dimensional Josephson junction qubit arrays [13], optical lattice traps in 
Bose-Einstein condensates [14], thin-layer diamond nanofilms [TS], as well as 
novel materials currently under development by research teams active world- 
wide. 

Implementation of the training algorithm requires a relatively limited 
number of physical constraints to be satisfied - high- dimensional superpo- 
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sition, multiple degrees of internal freedom, and sufficient timescale under 
decoherence-free coherent state evolution. Consequently, adaptive quantum 
networks may be implemented both in state-of-the-art quantum technolo- 
gies, as well as unexpectedly demonstrated within naturally-occurring dy- 
namic molecular complexes such as brain microtubes [18] and proteomics 
in molecular biology [T9l l2"Tj . Several natural systems have been observed 
to exhibit interference properties and high degrees of internal freedom - in- 
cluding fullerenes |17] . tetraphenyl-porphyrin [IT], small biomolecules |17j . 
and chlorophyll [19]. Contrary to conventional expections, Briegel et al. 
recently demonstrated persistent, driven entanglement in non-equilibrium 
quantum systems coupled to a hot and noisy environment, in which the 
presence of environmental noise played a constructive role in maintaining 
recurring quantum entanglement [21] . As such, promising candidates in co- 
herent quantum information processing technologies, condensed-matter sys- 
tems, and biophysics are all potential areas of investigation for experimental 
implementation of adaptive quantum networks [161 120] - 

Superposed quantum network topologies - adaptive quantum networks - 
as outlined in this paper, are unique in allowing for simultaneous training 
of both transition functions and optimization of whole-network topological 
structure. Quantitative backpropagation training provides predictive indica- 
tions for optimal reconfiguration of network performance, which can be ap- 
plied towards a broad class of problems currently addressed by classical neural 
networks. Continued progress will focus upon predictive simulation methods 
to determine convergence series in standardized sample classes, complex and 
chaotic attractor states under dynamic entanglement coupling strengths, al- 
ternative training paradigms, and nonlinear dynamics in the presence of en- 
vironmental coupling through partially-induced weak measurements. 
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